Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.0 MiB |
| Average record size in memory | 309.5 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 14 |
| Categorical | 3 |
avg_monthly_spend is highly overall correlated with spend_credit_score_interaction and 1 other fields | High correlation |
credit_card_limit is highly overall correlated with spend_to_limit_ratio | High correlation |
debt_to_income_ratio is highly overall correlated with loan_balance and 1 other fields | High correlation |
income is highly overall correlated with pca1 | High correlation |
loan_balance is highly overall correlated with debt_to_income_ratio and 1 other fields | High correlation |
loan_balance_log is highly overall correlated with debt_to_income_ratio and 1 other fields | High correlation |
missed_payments is highly overall correlated with pca1 | High correlation |
pca1 is highly overall correlated with income and 1 other fields | High correlation |
spend_credit_score_interaction is highly overall correlated with avg_monthly_spend and 1 other fields | High correlation |
spend_to_limit_ratio is highly overall correlated with avg_monthly_spend and 2 other fields | High correlation |
spend_to_limit_ratio is highly skewed (γ1 = -94.69734502) | Skewed |
debt_to_income_ratio is highly skewed (γ1 = 40.56501671) | Skewed |
customer_id has unique values | Unique |
pca1 has unique values | Unique |
pca2 has unique values | Unique |
spend_to_limit_ratio has unique values | Unique |
debt_to_income_ratio has unique values | Unique |
num_credit_cards has 1340 (13.4%) zeros | Zeros |
missed_payments has 2210 (22.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-14 16:49:34.122681 |
|---|---|
| Analysis finished | 2025-04-14 16:49:48.923688 |
| Duration | 14.8 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
customer_id
Text
Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 644.7 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 10000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | CUST00000 |
|---|---|
| 2nd row | CUST00001 |
| 3rd row | CUST00002 |
| 4th row | CUST00003 |
| 5th row | CUST00004 |
| Value | Count | Frequency (%) |
| cust00000 | 1 | < 0.1% |
| cust00008 | 1 | < 0.1% |
| cust00017 | 1 | < 0.1% |
| cust00002 | 1 | < 0.1% |
| cust00003 | 1 | < 0.1% |
| cust00004 | 1 | < 0.1% |
| cust00005 | 1 | < 0.1% |
| cust00006 | 1 | < 0.1% |
| cust00007 | 1 | < 0.1% |
| cust00009 | 1 | < 0.1% |
| Other values (9990) | 9990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 6 | 4000 | 4.4% |
| 7 | 4000 | 4.4% |
| 3 | 4000 | 4.4% |
| 4 | 4000 | 4.4% |
| 5 | 4000 | 4.4% |
| Other values (4) | 16000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 90000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 6 | 4000 | 4.4% |
| 7 | 4000 | 4.4% |
| 3 | 4000 | 4.4% |
| 4 | 4000 | 4.4% |
| 5 | 4000 | 4.4% |
| Other values (4) | 16000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 90000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 6 | 4000 | 4.4% |
| 7 | 4000 | 4.4% |
| 3 | 4000 | 4.4% |
| 4 | 4000 | 4.4% |
| 5 | 4000 | 4.4% |
| Other values (4) | 16000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 90000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 6 | 4000 | 4.4% |
| 7 | 4000 | 4.4% |
| 3 | 4000 | 4.4% |
| 4 | 4000 | 4.4% |
| 5 | 4000 | 4.4% |
| Other values (4) | 16000 |
age
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.5394 |
| Minimum | 18 |
|---|---|
| Maximum | 69 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 31 |
| median | 43 |
| Q3 | 56 |
| 95-th percentile | 67 |
| Maximum | 69 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.911636 |
|---|---|
| Coefficient of variation (CV) | 0.34248602 |
| Kurtosis | -1.1809278 |
| Mean | 43.5394 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.0020168078 |
| Sum | 435394 |
| Variance | 222.35688 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43 | 227 | 2.3% |
| 62 | 219 | 2.2% |
| 66 | 218 | 2.2% |
| 40 | 216 | 2.2% |
| 34 | 213 | 2.1% |
| 52 | 212 | 2.1% |
| 64 | 211 | 2.1% |
| 45 | 211 | 2.1% |
| 38 | 206 | 2.1% |
| 35 | 205 | 2.1% |
| Other values (42) | 7862 |
| Value | Count | Frequency (%) |
| 18 | 177 | |
| 19 | 201 | |
| 20 | 191 | |
| 21 | 196 | |
| 22 | 189 | |
| 23 | 190 | |
| 24 | 163 | |
| 25 | 192 | |
| 26 | 181 | |
| 27 | 176 |
| Value | Count | Frequency (%) |
| 69 | 181 | |
| 68 | 196 | |
| 67 | 178 | |
| 66 | 218 | |
| 65 | 178 | |
| 64 | 211 | |
| 63 | 172 | |
| 62 | 219 | |
| 61 | 192 | |
| 60 | 167 |
income
Real number (ℝ)
High correlation 
| Distinct | 9986 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59996.199 |
| Minimum | -18448.01 |
|---|---|
| Maximum | 130581.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 16 |
| Negative (%) | 0.2% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -18448.01 |
|---|---|
| 5-th percentile | 26952.882 |
| Q1 | 46564.598 |
| median | 59942.99 |
| Q3 | 73532.582 |
| 95-th percentile | 92779.506 |
| Maximum | 130581.1 |
| Range | 149029.11 |
| Interquartile range (IQR) | 26967.985 |
Descriptive statistics
| Standard deviation | 20092.986 |
|---|---|
| Coefficient of variation (CV) | 0.33490431 |
| Kurtosis | 0.035065369 |
| Mean | 59996.199 |
| Median Absolute Deviation (MAD) | 13510.915 |
| Skewness | -0.00043851986 |
| Sum | 5.9996199 × 108 |
| Variance | 4.0372808 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44487.87 | 2 | < 0.1% |
| 62367 | 2 | < 0.1% |
| 52487.33 | 2 | < 0.1% |
| 52659.55 | 2 | < 0.1% |
| 60467.7 | 2 | < 0.1% |
| 63028.15 | 2 | < 0.1% |
| 79888.29 | 2 | < 0.1% |
| 78438.73 | 2 | < 0.1% |
| 62734.93 | 2 | < 0.1% |
| 58965.74 | 2 | < 0.1% |
| Other values (9976) | 9980 |
| Value | Count | Frequency (%) |
| -18448.01 | 1 | |
| -16733.11 | 1 | |
| -13767.31 | 1 | |
| -12021.7 | 1 | |
| -7511.58 | 1 | |
| -6590.08 | 1 | |
| -6422.29 | 1 | |
| -5006.67 | 1 | |
| -4830.28 | 1 | |
| -4651.31 | 1 |
| Value | Count | Frequency (%) |
| 130581.1 | 1 | |
| 128578.21 | 1 | |
| 127555.36 | 1 | |
| 127547.66 | 1 | |
| 125755.22 | 1 | |
| 125714.47 | 1 | |
| 125682.36 | 1 | |
| 123731.5 | 1 | |
| 123155.43 | 1 | |
| 122353.62 | 1 |
employment_status
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 639.8 KiB |
| Employed | |
|---|---|
| Freelancer | |
| Unemployed | |
| Retired |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.5043 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unemployed |
|---|---|
| 2nd row | Freelancer |
| 3rd row | Freelancer |
| 4th row | Unemployed |
| 5th row | Retired |
Common Values
| Value | Count | Frequency (%) |
| Employed | 5962 | |
| Freelancer | 1976 | 19.8% |
| Unemployed | 1051 | 10.5% |
| Retired | 1011 | 10.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| employed | 5962 | |
| freelancer | 1976 | 19.8% |
| unemployed | 1051 | 10.5% |
| retired | 1011 | 10.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 16014 | |
| l | 8989 | |
| d | 8024 | |
| p | 7013 | |
| o | 7013 | |
| y | 7013 | |
| m | 7013 | |
| E | 5962 | 7.0% |
| r | 4963 | 5.8% |
| n | 3027 | 3.6% |
| Other values (7) | 10012 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 85043 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 16014 | |
| l | 8989 | |
| d | 8024 | |
| p | 7013 | |
| o | 7013 | |
| y | 7013 | |
| m | 7013 | |
| E | 5962 | 7.0% |
| r | 4963 | 5.8% |
| n | 3027 | 3.6% |
| Other values (7) | 10012 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 85043 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 16014 | |
| l | 8989 | |
| d | 8024 | |
| p | 7013 | |
| o | 7013 | |
| y | 7013 | |
| m | 7013 | |
| E | 5962 | 7.0% |
| r | 4963 | 5.8% |
| n | 3027 | 3.6% |
| Other values (7) | 10012 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 85043 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 16014 | |
| l | 8989 | |
| d | 8024 | |
| p | 7013 | |
| o | 7013 | |
| y | 7013 | |
| m | 7013 | |
| E | 5962 | 7.0% |
| r | 4963 | 5.8% |
| n | 3027 | 3.6% |
| Other values (7) | 10012 |
credit_card_limit
Real number (ℝ)
High correlation 
| Distinct | 9953 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9972.4282 |
| Minimum | -3396.81 |
|---|---|
| Maximum | 21074.87 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 6 |
| Negative (%) | 0.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -3396.81 |
|---|---|
| 5-th percentile | 5118.9795 |
| Q1 | 7905.7675 |
| median | 9992.105 |
| Q3 | 12022.95 |
| 95-th percentile | 14938.139 |
| Maximum | 21074.87 |
| Range | 24471.68 |
| Interquartile range (IQR) | 4117.1825 |
Descriptive statistics
| Standard deviation | 2990.1251 |
|---|---|
| Coefficient of variation (CV) | 0.29983922 |
| Kurtosis | -0.009431925 |
| Mean | 9972.4282 |
| Median Absolute Deviation (MAD) | 2053.71 |
| Skewness | 0.0076802263 |
| Sum | 99724282 |
| Variance | 8940848.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8707.97 | 2 | < 0.1% |
| 11461.03 | 2 | < 0.1% |
| 11905.39 | 2 | < 0.1% |
| 11930.92 | 2 | < 0.1% |
| 12444.96 | 2 | < 0.1% |
| 7638.56 | 2 | < 0.1% |
| 15081.87 | 2 | < 0.1% |
| 11286.32 | 2 | < 0.1% |
| 13682.38 | 2 | < 0.1% |
| 6837.55 | 2 | < 0.1% |
| Other values (9943) | 9980 |
| Value | Count | Frequency (%) |
| -3396.81 | 1 | |
| -598.45 | 1 | |
| -485.14 | 1 | |
| -360.06 | 1 | |
| -282.73 | 1 | |
| -18.07 | 1 | |
| 389.96 | 1 | |
| 522.74 | 1 | |
| 720.32 | 1 | |
| 749.61 | 1 |
| Value | Count | Frequency (%) |
| 21074.87 | 1 | |
| 20063.72 | 1 | |
| 20044.62 | 1 | |
| 19792.87 | 1 | |
| 19646.12 | 1 | |
| 19476.83 | 1 | |
| 19474.22 | 1 | |
| 19410.06 | 1 | |
| 19396.98 | 1 | |
| 19114.84 | 1 |
num_credit_cards
Real number (ℝ)
Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0002 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 1340 |
| Zeros (%) | 13.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4121612 |
|---|---|
| Coefficient of variation (CV) | 0.70601002 |
| Kurtosis | 0.49196383 |
| Mean | 2.0002 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.70571991 |
| Sum | 20002 |
| Variance | 1.9941994 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2730 | |
| 2 | 2694 | |
| 3 | 1812 | |
| 0 | 1340 | |
| 4 | 888 | 8.9% |
| 5 | 376 | 3.8% |
| 6 | 118 | 1.2% |
| 7 | 32 | 0.3% |
| 8 | 7 | 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1340 | |
| 1 | 2730 | |
| 2 | 2694 | |
| 3 | 1812 | |
| 4 | 888 | 8.9% |
| 5 | 376 | 3.8% |
| 6 | 118 | 1.2% |
| 7 | 32 | 0.3% |
| 8 | 7 | 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 7 | 0.1% |
| 7 | 32 | 0.3% |
| 6 | 118 | 1.2% |
| 5 | 376 | 3.8% |
| 4 | 888 | 8.9% |
| 3 | 1812 | |
| 2 | 2694 | |
| 1 | 2730 |
credit_score
Real number (ℝ)
| Distinct | 308 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 680.7964 |
| Minimum | 472 |
|---|---|
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 472 |
|---|---|
| 5-th percentile | 599 |
| Q1 | 646 |
| median | 681 |
| Q3 | 715 |
| 95-th percentile | 763 |
| Maximum | 850 |
| Range | 378 |
| Interquartile range (IQR) | 69 |
Descriptive statistics
| Standard deviation | 50.272319 |
|---|---|
| Coefficient of variation (CV) | 0.073843398 |
| Kurtosis | -0.049016738 |
| Mean | 680.7964 |
| Median Absolute Deviation (MAD) | 34 |
| Skewness | 0.00014289622 |
| Sum | 6807964 |
| Variance | 2527.3061 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 684 | 90 | 0.9% |
| 687 | 90 | 0.9% |
| 690 | 89 | 0.9% |
| 681 | 87 | 0.9% |
| 688 | 84 | 0.8% |
| 665 | 84 | 0.8% |
| 666 | 83 | 0.8% |
| 686 | 83 | 0.8% |
| 691 | 82 | 0.8% |
| 676 | 82 | 0.8% |
| Other values (298) | 9146 |
| Value | Count | Frequency (%) |
| 472 | 1 | < 0.1% |
| 494 | 1 | < 0.1% |
| 514 | 1 | < 0.1% |
| 520 | 2 | |
| 521 | 1 | < 0.1% |
| 522 | 1 | < 0.1% |
| 526 | 1 | < 0.1% |
| 528 | 3 | |
| 533 | 1 | < 0.1% |
| 535 | 2 |
| Value | Count | Frequency (%) |
| 850 | 5 | |
| 843 | 1 | < 0.1% |
| 833 | 1 | < 0.1% |
| 832 | 1 | < 0.1% |
| 831 | 1 | < 0.1% |
| 830 | 1 | < 0.1% |
| 829 | 1 | < 0.1% |
| 828 | 2 | < 0.1% |
| 827 | 1 | < 0.1% |
| 825 | 1 | < 0.1% |
loan_balance
Real number (ℝ)
High correlation 
| Distinct | 9975 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14877.974 |
| Minimum | 1.51 |
|---|---|
| Maximum | 176120.56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1.51 |
|---|---|
| 5-th percentile | 747.0815 |
| Q1 | 4134.1125 |
| median | 10335.495 |
| Q3 | 20732.778 |
| 95-th percentile | 44650.295 |
| Maximum | 176120.56 |
| Range | 176119.05 |
| Interquartile range (IQR) | 16598.665 |
Descriptive statistics
| Standard deviation | 14985.761 |
|---|---|
| Coefficient of variation (CV) | 1.0072447 |
| Kurtosis | 6.8351109 |
| Mean | 14877.974 |
| Median Absolute Deviation (MAD) | 7308.465 |
| Skewness | 2.0583806 |
| Sum | 1.4877974 × 108 |
| Variance | 2.2457305 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15211.71 | 2 | < 0.1% |
| 212.75 | 2 | < 0.1% |
| 12434.56 | 2 | < 0.1% |
| 5500.28 | 2 | < 0.1% |
| 2231.51 | 2 | < 0.1% |
| 15596 | 2 | < 0.1% |
| 2390.65 | 2 | < 0.1% |
| 506.68 | 2 | < 0.1% |
| 1147.37 | 2 | < 0.1% |
| 13141.43 | 2 | < 0.1% |
| Other values (9965) | 9980 |
| Value | Count | Frequency (%) |
| 1.51 | 1 | |
| 1.93 | 1 | |
| 2.11 | 1 | |
| 2.44 | 1 | |
| 2.7 | 1 | |
| 5.28 | 1 | |
| 7.24 | 1 | |
| 8.14 | 1 | |
| 9.51 | 1 | |
| 12.64 | 1 |
| Value | Count | Frequency (%) |
| 176120.56 | 1 | |
| 127465.71 | 1 | |
| 124666.25 | 1 | |
| 121852.79 | 1 | |
| 120146.11 | 1 | |
| 118257.58 | 1 | |
| 115592.23 | 1 | |
| 115350.15 | 1 | |
| 111674.53 | 1 | |
| 110958.48 | 1 |
avg_monthly_spend
Real number (ℝ)
High correlation 
| Distinct | 9766 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1800.8887 |
| Minimum | -877.78 |
|---|---|
| Maximum | 3963.35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 15 |
| Negative (%) | 0.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -877.78 |
|---|---|
| 5-th percentile | 792.4695 |
| Q1 | 1399.625 |
| median | 1800.89 |
| Q3 | 2206.735 |
| 95-th percentile | 2822.5975 |
| Maximum | 3963.35 |
| Range | 4841.13 |
| Interquartile range (IQR) | 807.11 |
Descriptive statistics
| Standard deviation | 607.11077 |
|---|---|
| Coefficient of variation (CV) | 0.33711732 |
| Kurtosis | -0.030312913 |
| Mean | 1800.8887 |
| Median Absolute Deviation (MAD) | 403.07 |
| Skewness | -0.0016729499 |
| Sum | 18008887 |
| Variance | 368583.49 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1364.18 | 3 | < 0.1% |
| 1719.72 | 3 | < 0.1% |
| 1884.29 | 3 | < 0.1% |
| 1483.18 | 3 | < 0.1% |
| 1434.7 | 2 | < 0.1% |
| 1747.06 | 2 | < 0.1% |
| 1989.85 | 2 | < 0.1% |
| 2129.81 | 2 | < 0.1% |
| 1680.26 | 2 | < 0.1% |
| 2238.68 | 2 | < 0.1% |
| Other values (9756) | 9976 |
| Value | Count | Frequency (%) |
| -877.78 | 1 | |
| -564.01 | 1 | |
| -285.02 | 1 | |
| -256.19 | 1 | |
| -239 | 1 | |
| -184.43 | 1 | |
| -117.57 | 1 | |
| -109.83 | 1 | |
| -89.9 | 1 | |
| -84.47 | 1 |
| Value | Count | Frequency (%) |
| 3963.35 | 1 | |
| 3863.23 | 1 | |
| 3835.05 | 1 | |
| 3830.24 | 1 | |
| 3713.55 | 1 | |
| 3703.35 | 1 | |
| 3699.43 | 1 | |
| 3682.22 | 1 | |
| 3675.28 | 1 | |
| 3645.4 | 1 |
missed_payments
Real number (ℝ)
High correlation  Zeros 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4909 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2210 |
| Zeros (%) | 22.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.2155924 |
|---|---|
| Coefficient of variation (CV) | 0.81534136 |
| Kurtosis | 0.68499405 |
| Mean | 1.4909 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.82812407 |
| Sum | 14909 |
| Variance | 1.477665 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3417 | |
| 2 | 2514 | |
| 0 | 2210 | |
| 3 | 1202 | 12.0% |
| 4 | 479 | 4.8% |
| 5 | 138 | 1.4% |
| 6 | 29 | 0.3% |
| 7 | 10 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2210 | |
| 1 | 3417 | |
| 2 | 2514 | |
| 3 | 1202 | 12.0% |
| 4 | 479 | 4.8% |
| 5 | 138 | 1.4% |
| 6 | 29 | 0.3% |
| 7 | 10 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 10 | 0.1% |
| 6 | 29 | 0.3% |
| 5 | 138 | 1.4% |
| 4 | 479 | 4.8% |
| 3 | 1202 | 12.0% |
| 2 | 2514 | |
| 1 | 3417 | |
| 0 | 2210 |
region
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 605.4 KiB |
| East | |
|---|---|
| West | |
| North | |
| South | |
| Central |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 4.9777 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | West |
|---|---|
| 2nd row | East |
| 3rd row | Central |
| 4th row | West |
| 5th row | South |
Common Values
| Value | Count | Frequency (%) |
| East | 2063 | |
| West | 2012 | |
| North | 2012 | |
| South | 1987 | |
| Central | 1926 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| east | 2063 | |
| west | 2012 | |
| north | 2012 | |
| south | 1987 | |
| central | 1926 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 10000 | |
| s | 4075 | |
| o | 3999 | 8.0% |
| h | 3999 | 8.0% |
| a | 3989 | 8.0% |
| e | 3938 | 7.9% |
| r | 3938 | 7.9% |
| E | 2063 | 4.1% |
| W | 2012 | 4.0% |
| N | 2012 | 4.0% |
| Other values (5) | 9752 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 49777 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 10000 | |
| s | 4075 | |
| o | 3999 | 8.0% |
| h | 3999 | 8.0% |
| a | 3989 | 8.0% |
| e | 3938 | 7.9% |
| r | 3938 | 7.9% |
| E | 2063 | 4.1% |
| W | 2012 | 4.0% |
| N | 2012 | 4.0% |
| Other values (5) | 9752 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 49777 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 10000 | |
| s | 4075 | |
| o | 3999 | 8.0% |
| h | 3999 | 8.0% |
| a | 3989 | 8.0% |
| e | 3938 | 7.9% |
| r | 3938 | 7.9% |
| E | 2063 | 4.1% |
| W | 2012 | 4.0% |
| N | 2012 | 4.0% |
| Other values (5) | 9752 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 49777 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 10000 | |
| s | 4075 | |
| o | 3999 | 8.0% |
| h | 3999 | 8.0% |
| a | 3989 | 8.0% |
| e | 3938 | 7.9% |
| r | 3938 | 7.9% |
| E | 2063 | 4.1% |
| W | 2012 | 4.0% |
| N | 2012 | 4.0% |
| Other values (5) | 9752 |
loan_balance_log
Real number (ℝ)
High correlation 
| Distinct | 9975 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.019635 |
| Minimum | 0.92028275 |
|---|---|
| Maximum | 12.07893 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0.92028275 |
|---|---|
| 5-th percentile | 6.6175116 |
| Q1 | 8.3272698 |
| median | 9.2434361 |
| Q3 | 9.9395194 |
| 95-th percentile | 10.706639 |
| Maximum | 12.07893 |
| Range | 11.158647 |
| Interquartile range (IQR) | 1.6122496 |
Descriptive statistics
| Standard deviation | 1.2975799 |
|---|---|
| Coefficient of variation (CV) | 0.14386169 |
| Kurtosis | 2.259858 |
| Mean | 9.019635 |
| Median Absolute Deviation (MAD) | 0.78164912 |
| Skewness | -1.1307792 |
| Sum | 90196.35 |
| Variance | 1.6837137 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.629886542 | 2 | < 0.1% |
| 5.364807108 | 2 | < 0.1% |
| 9.428315389 | 2 | < 0.1% |
| 8.612736071 | 2 | < 0.1% |
| 7.710881792 | 2 | < 0.1% |
| 9.654833867 | 2 | < 0.1% |
| 7.779738783 | 2 | < 0.1% |
| 6.229851328 | 2 | < 0.1% |
| 7.046098825 | 2 | < 0.1% |
| 9.483601206 | 2 | < 0.1% |
| Other values (9965) | 9980 |
| Value | Count | Frequency (%) |
| 0.9202827531 | 1 | |
| 1.075002423 | 1 | |
| 1.134622726 | 1 | |
| 1.235471471 | 1 | |
| 1.30833282 | 1 | |
| 1.83736998 | 1 | |
| 2.109000344 | 1 | |
| 2.212660385 | 1 | |
| 2.352327185 | 1 | |
| 2.613006652 | 1 |
| Value | Count | Frequency (%) |
| 12.07892972 | 1 | |
| 11.75561051 | 1 | |
| 11.73340347 | 1 | |
| 11.71057716 | 1 | |
| 11.69647219 | 1 | |
| 11.68062886 | 1 | |
| 11.65783267 | 1 | |
| 11.65573623 | 1 | |
| 11.62335289 | 1 | |
| 11.61692037 | 1 |
pca1
Real number (ℝ)
High correlation  Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -7.8159701 × 10-18 |
| Minimum | -4.1748279 |
|---|---|
| Maximum | 4.2730381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5090 |
| Negative (%) | 50.9% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -4.1748279 |
|---|---|
| 5-th percentile | -1.6199741 |
| Q1 | -0.69604755 |
| median | -0.023379362 |
| Q3 | 0.68756774 |
| 95-th percentile | 1.7024509 |
| Maximum | 4.2730381 |
| Range | 8.447866 |
| Interquartile range (IQR) | 1.3836153 |
Descriptive statistics
| Standard deviation | 1.0183696 |
|---|---|
| Coefficient of variation (CV) | -1.3029343 × 1017 |
| Kurtosis | 0.14564389 |
| Mean | -7.8159701 × 10-18 |
| Median Absolute Deviation (MAD) | 0.69020032 |
| Skewness | 0.12961845 |
| Sum | 7.1054274 × 10-15 |
| Variance | 1.0370766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.210173851 | 1 | < 0.1% |
| -0.005025252127 | 1 | < 0.1% |
| -0.5225779145 | 1 | < 0.1% |
| 0.6381900492 | 1 | < 0.1% |
| 1.596159711 | 1 | < 0.1% |
| -0.958760143 | 1 | < 0.1% |
| 1.042856183 | 1 | < 0.1% |
| -1.810144415 | 1 | < 0.1% |
| -0.7222099121 | 1 | < 0.1% |
| -1.357563637 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| -4.174827876 | 1 | |
| -3.714249239 | 1 | |
| -3.68035998 | 1 | |
| -3.533007898 | 1 | |
| -3.351111675 | 1 | |
| -3.322100361 | 1 | |
| -3.306036454 | 1 | |
| -3.301199625 | 1 | |
| -3.271621491 | 1 | |
| -3.147857749 | 1 |
| Value | Count | Frequency (%) |
| 4.27303808 | 1 | |
| 4.03271911 | 1 | |
| 3.93765415 | 1 | |
| 3.795168638 | 1 | |
| 3.790700504 | 1 | |
| 3.698110751 | 1 | |
| 3.584267974 | 1 | |
| 3.553059477 | 1 | |
| 3.483948061 | 1 | |
| 3.448801841 | 1 |
pca2
Real number (ℝ)
Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1.6342483 × 10-17 |
| Minimum | -3.8935813 |
|---|---|
| Maximum | 4.1699017 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5063 |
| Negative (%) | 50.6% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -3.8935813 |
|---|---|
| 5-th percentile | -1.6136636 |
| Q1 | -0.68003503 |
| median | -0.015310016 |
| Q3 | 0.65609536 |
| 95-th percentile | 1.6925444 |
| Maximum | 4.1699017 |
| Range | 8.063483 |
| Interquartile range (IQR) | 1.3361304 |
Descriptive statistics
| Standard deviation | 1.0127757 |
|---|---|
| Coefficient of variation (CV) | -6.1971957 × 1016 |
| Kurtosis | 0.20130346 |
| Mean | -1.6342483 × 10-17 |
| Median Absolute Deviation (MAD) | 0.6688737 |
| Skewness | 0.12458755 |
| Sum | -1.4210855 × 10-13 |
| Variance | 1.0257145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.728224002 | 1 | < 0.1% |
| -0.2737628105 | 1 | < 0.1% |
| -2.947089522 | 1 | < 0.1% |
| 1.560447233 | 1 | < 0.1% |
| 0.2203495728 | 1 | < 0.1% |
| 0.3427847577 | 1 | < 0.1% |
| -0.06437472275 | 1 | < 0.1% |
| 0.598457478 | 1 | < 0.1% |
| 0.9183522908 | 1 | < 0.1% |
| -0.3313555229 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| -3.893581329 | 1 | |
| -3.566164472 | 1 | |
| -3.374005363 | 1 | |
| -3.142957747 | 1 | |
| -3.142750545 | 1 | |
| -3.129998414 | 1 | |
| -3.118694452 | 1 | |
| -3.056756864 | 1 | |
| -3.04136719 | 1 | |
| -3.024853333 | 1 |
| Value | Count | Frequency (%) |
| 4.169901692 | 1 | |
| 3.891479469 | 1 | |
| 3.826228206 | 1 | |
| 3.628458344 | 1 | |
| 3.619316327 | 1 | |
| 3.603268093 | 1 | |
| 3.568688902 | 1 | |
| 3.495234507 | 1 | |
| 3.441559415 | 1 | |
| 3.426973589 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 2991 | |
| 0 | 2895 | |
| 1 | 2822 | |
| 3 | 1292 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 2991 | |
| 0 | 2895 | |
| 1 | 2822 | |
| 3 | 1292 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2991 | |
| 0 | 2895 | |
| 1 | 2822 | |
| 3 | 1292 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2991 | |
| 0 | 2895 | |
| 1 | 2822 | |
| 3 | 1292 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2991 | |
| 0 | 2895 | |
| 1 | 2822 | |
| 3 | 1292 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2991 | |
| 0 | 2895 | |
| 1 | 2822 | |
| 3 | 1292 |
spend_to_limit_ratio
Real number (ℝ)
High correlation  Skewed  Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.19427624 |
| Minimum | -86.49092 |
|---|---|
| Maximum | 4.3803714 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 21 |
| Negative (%) | 0.2% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -86.49092 |
|---|---|
| 5-th percentile | 0.073709686 |
| Q1 | 0.13254135 |
| median | 0.17905158 |
| Q3 | 0.24505953 |
| 95-th percentile | 0.39514019 |
| Maximum | 4.3803714 |
| Range | 90.871291 |
| Interquartile range (IQR) | 0.11251818 |
Descriptive statistics
| Standard deviation | 0.88294653 |
|---|---|
| Coefficient of variation (CV) | 4.5447993 |
| Kurtosis | 9294.712 |
| Mean | 0.19427624 |
| Median Absolute Deviation (MAD) | 0.054050016 |
| Skewness | -94.697345 |
| Sum | 1942.7624 |
| Variance | 0.77959457 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.2373240139 | 1 | < 0.1% |
| 0.1989202288 | 1 | < 0.1% |
| 0.1948834084 | 1 | < 0.1% |
| 0.2263309198 | 1 | < 0.1% |
| 0.4613531079 | 1 | < 0.1% |
| 0.1069009247 | 1 | < 0.1% |
| 0.1154605386 | 1 | < 0.1% |
| 0.03877919054 | 1 | < 0.1% |
| 0.09896099179 | 1 | < 0.1% |
| 0.2187432348 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| -86.49091974 | 1 | |
| -7.087424129 | 1 | |
| -4.315260049 | 1 | |
| -2.42673027 | 1 | |
| -0.6929204033 | 1 | |
| -0.6785950922 | 1 | |
| -0.08343046992 | 1 | |
| -0.05935228935 | 1 | |
| -0.024309127 | 1 | |
| -0.02203559071 | 1 |
| Value | Count | Frequency (%) |
| 4.380371393 | 1 | |
| 3.151067323 | 1 | |
| 3.144887151 | 1 | |
| 2.474894626 | 1 | |
| 2.457419059 | 1 | |
| 2.297622326 | 1 | |
| 2.263068403 | 1 | |
| 2.096018203 | 1 | |
| 2.005036387 | 1 | |
| 1.918373057 | 1 |
debt_to_income_ratio
Real number (ℝ)
High correlation  Skewed  Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.32352152 |
| Minimum | -21.856784 |
|---|---|
| Maximum | 93.567037 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 16 |
| Negative (%) | 0.2% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -21.856784 |
|---|---|
| 5-th percentile | 0.011474555 |
| Q1 | 0.070354839 |
| median | 0.17728227 |
| Q3 | 0.37045131 |
| 95-th percentile | 0.92215009 |
| Maximum | 93.567037 |
| Range | 115.42382 |
| Interquartile range (IQR) | 0.30009647 |
Descriptive statistics
| Standard deviation | 1.5056987 |
|---|---|
| Coefficient of variation (CV) | 4.6540914 |
| Kurtosis | 2115.3613 |
| Mean | 0.32352152 |
| Median Absolute Deviation (MAD) | 0.12873978 |
| Skewness | 40.565017 |
| Sum | 3235.2152 |
| Variance | 2.2671287 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3231938624 | 1 | < 0.1% |
| 0.04633241708 | 1 | < 0.1% |
| 0.01781460424 | 1 | < 0.1% |
| 0.4353905014 | 1 | < 0.1% |
| 0.09195809431 | 1 | < 0.1% |
| 0.1885049258 | 1 | < 0.1% |
| 0.1516143939 | 1 | < 0.1% |
| 0.3461478053 | 1 | < 0.1% |
| 0.2278953553 | 1 | < 0.1% |
| 0.002311830616 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| -21.85678392 | 1 | |
| -13.02126372 | 1 | |
| -9.710478476 | 1 | |
| -7.706448239 | 1 | |
| -3.161832038 | 1 | |
| -1.630646846 | 1 | |
| -1.321868865 | 1 | |
| -1.285877694 | 1 | |
| -1.269276165 | 1 | |
| -1.072802633 | 1 |
| Value | Count | Frequency (%) |
| 93.56703712 | 1 | |
| 64.6786197 | 1 | |
| 58.53316604 | 1 | |
| 42.53090543 | 1 | |
| 29.94576354 | 1 | |
| 21.39039747 | 1 | |
| 14.4134637 | 1 | |
| 13.30860258 | 1 | |
| 12.62925574 | 1 | |
| 12.30746968 | 1 |
spend_credit_score_interaction
Real number (ℝ)
High correlation 
| Distinct | 9997 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1225995.4 |
| Minimum | -596012.62 |
|---|---|
| Maximum | 2855282.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 15 |
| Negative (%) | 0.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -596012.62 |
|---|---|
| 5-th percentile | 530102.56 |
| Q1 | 942453.62 |
| median | 1220242.5 |
| Q3 | 1505750.7 |
| 95-th percentile | 1938993.9 |
| Maximum | 2855282.9 |
| Range | 3451295.5 |
| Interquartile range (IQR) | 563297.06 |
Descriptive statistics
| Standard deviation | 424059.82 |
|---|---|
| Coefficient of variation (CV) | 0.34589021 |
| Kurtosis | 0.021863137 |
| Mean | 1225995.4 |
| Median Absolute Deviation (MAD) | 282677.19 |
| Skewness | 0.088409313 |
| Sum | 1.2259954 × 1010 |
| Variance | 1.7982673 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1414466.08 | 2 | < 0.1% |
| 1118288.64 | 2 | < 0.1% |
| 1359400.23 | 2 | < 0.1% |
| 1658287.26 | 1 | < 0.1% |
| 771675.82 | 1 | < 0.1% |
| 915731.52 | 1 | < 0.1% |
| 1039820.76 | 1 | < 0.1% |
| 1630971.5 | 1 | < 0.1% |
| 1120411.95 | 1 | < 0.1% |
| 1382406.72 | 1 | < 0.1% |
| Other values (9987) | 9987 |
| Value | Count | Frequency (%) |
| -596012.62 | 1 | |
| -358146.35 | 1 | |
| -200654.08 | 1 | |
| -174465.39 | 1 | |
| -152004 | 1 | |
| -119326.21 | 1 | |
| -77713.77 | 1 | |
| -76222.02 | 1 | |
| -64281.67 | 1 | |
| -61581.5 | 1 |
| Value | Count | Frequency (%) |
| 2855282.85 | 1 | |
| 2812313.49 | 1 | |
| 2774149.4 | 1 | |
| 2757400.95 | 1 | |
| 2737130.32 | 1 | |
| 2714631.36 | 1 | |
| 2691449.76 | 1 | |
| 2557994.88 | 1 | |
| 2531788.64 | 1 | |
| 2530415.65 | 1 |
Interactions
Correlations
| age | avg_monthly_spend | cluster | credit_card_limit | credit_score | debt_to_income_ratio | employment_status | income | loan_balance | loan_balance_log | missed_payments | num_credit_cards | pca1 | pca2 | region | spend_credit_score_interaction | spend_to_limit_ratio | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | -0.005 | 0.048 | -0.005 | -0.005 | -0.001 | 0.009 | -0.002 | -0.005 | -0.005 | 0.005 | -0.003 | 0.144 | 0.285 | 0.000 | -0.004 | -0.001 |
| avg_monthly_spend | -0.005 | 1.000 | 0.314 | 0.018 | -0.001 | -0.002 | 0.000 | -0.015 | -0.005 | -0.005 | 0.002 | 0.013 | 0.283 | -0.464 | 0.008 | 0.973 | 0.719 |
| cluster | 0.048 | 0.314 | 1.000 | 0.365 | 0.153 | 0.177 | 0.000 | 0.223 | 0.486 | 0.381 | 0.174 | 0.104 | 0.305 | 0.232 | 0.009 | 0.345 | 0.000 |
| credit_card_limit | -0.005 | 0.018 | 0.365 | 1.000 | -0.002 | 0.000 | 0.015 | 0.018 | 0.006 | 0.006 | 0.004 | -0.009 | 0.018 | -0.237 | 0.000 | 0.018 | -0.624 |
| credit_score | -0.005 | -0.001 | 0.153 | -0.002 | 1.000 | 0.017 | 0.000 | -0.012 | 0.017 | 0.017 | 0.015 | -0.003 | 0.322 | 0.491 | 0.000 | 0.205 | 0.003 |
| debt_to_income_ratio | -0.001 | -0.002 | 0.177 | 0.000 | 0.017 | 1.000 | 0.000 | -0.279 | 0.942 | 0.942 | -0.019 | -0.006 | -0.096 | 0.338 | 0.012 | 0.003 | -0.001 |
| employment_status | 0.009 | 0.000 | 0.000 | 0.015 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.006 | 0.010 | 0.019 | 0.008 | 0.000 | 0.000 | 0.000 | 0.000 |
| income | -0.002 | -0.015 | 0.223 | 0.018 | -0.012 | -0.279 | 0.000 | 1.000 | 0.001 | 0.001 | -0.019 | -0.007 | -0.534 | -0.099 | 0.000 | -0.017 | -0.021 |
| loan_balance | -0.005 | -0.005 | 0.486 | 0.006 | 0.017 | 0.942 | 0.000 | 0.001 | 1.000 | 1.000 | -0.027 | -0.009 | -0.266 | 0.330 | 0.004 | 0.000 | -0.007 |
| loan_balance_log | -0.005 | -0.005 | 0.381 | 0.006 | 0.017 | 0.942 | 0.006 | 0.001 | 1.000 | 1.000 | -0.027 | -0.009 | -0.266 | 0.330 | 0.002 | 0.000 | -0.007 |
| missed_payments | 0.005 | 0.002 | 0.174 | 0.004 | 0.015 | -0.019 | 0.010 | -0.019 | -0.027 | -0.027 | 1.000 | -0.006 | 0.566 | 0.195 | 0.000 | 0.004 | 0.001 |
| num_credit_cards | -0.003 | 0.013 | 0.104 | -0.009 | -0.003 | -0.006 | 0.019 | -0.007 | -0.009 | -0.009 | -0.006 | 1.000 | 0.262 | -0.416 | 0.000 | 0.013 | 0.022 |
| pca1 | 0.144 | 0.283 | 0.305 | 0.018 | 0.322 | -0.096 | 0.008 | -0.534 | -0.266 | -0.266 | 0.566 | 0.262 | 1.000 | 0.012 | 0.011 | 0.345 | 0.200 |
| pca2 | 0.285 | -0.464 | 0.232 | -0.237 | 0.491 | 0.338 | 0.000 | -0.099 | 0.330 | 0.330 | 0.195 | -0.416 | 0.012 | 1.000 | 0.000 | -0.344 | -0.192 |
| region | 0.000 | 0.008 | 0.009 | 0.000 | 0.000 | 0.012 | 0.000 | 0.000 | 0.004 | 0.002 | 0.000 | 0.000 | 0.011 | 0.000 | 1.000 | 0.014 | 0.000 |
| spend_credit_score_interaction | -0.004 | 0.973 | 0.345 | 0.018 | 0.205 | 0.003 | 0.000 | -0.017 | 0.000 | 0.000 | 0.004 | 0.013 | 0.345 | -0.344 | 0.014 | 1.000 | 0.701 |
| spend_to_limit_ratio | -0.001 | 0.719 | 0.000 | -0.624 | 0.003 | -0.001 | 0.000 | -0.021 | -0.007 | -0.007 | 0.001 | 0.022 | 0.200 | -0.192 | 0.000 | 0.701 | 1.000 |
Missing values
Sample
| customer_id | age | income | employment_status | credit_card_limit | num_credit_cards | credit_score | loan_balance | avg_monthly_spend | missed_payments | region | loan_balance_log | pca1 | pca2 | cluster | spend_to_limit_ratio | debt_to_income_ratio | spend_credit_score_interaction | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CUST00000 | 56 | 42952.26 | Unemployed | 9026.70 | 0 | 774.0 | 13882.23 | 2142.49 | 2 | West | 9.538437 | 1.210174 | 1.728224 | 1 | 0.237324 | 0.323194 | 1658287.26 |
| 1 | CUST00001 | 69 | 69507.31 | Freelancer | 16004.73 | 1 | 754.0 | 6338.18 | 1373.80 | 1 | East | 8.754505 | 0.049432 | 0.999300 | 0 | 0.085832 | 0.091186 | 1035845.20 |
| 2 | CUST00002 | 46 | 72649.08 | Freelancer | 9410.59 | 6 | 720.0 | 13418.14 | 1397.71 | 0 | Central | 9.504437 | -0.161978 | -0.767478 | 2 | 0.148509 | 0.184695 | 1006351.20 |
| 3 | CUST00003 | 32 | 50516.44 | Unemployed | 12264.08 | 2 | 728.0 | 35450.65 | 2103.35 | 3 | West | 10.475925 | 0.909416 | 0.663126 | 3 | 0.171491 | 0.701751 | 1531238.80 |
| 4 | CUST00004 | 60 | 44564.56 | Retired | 12156.74 | 4 | 802.0 | 15181.27 | 1528.36 | 0 | South | 9.627884 | 0.909492 | 0.778636 | 1 | 0.125711 | 0.340650 | 1225744.72 |
| 5 | CUST00005 | 25 | 92190.41 | Unemployed | 6459.93 | 2 | 719.0 | 27489.93 | 1670.70 | 2 | Central | 10.221611 | -0.886300 | 0.672202 | 2 | 0.258585 | 0.298183 | 1201233.30 |
| 6 | CUST00006 | 38 | 64549.92 | Retired | 4463.82 | 0 | 647.0 | 6338.24 | 1736.66 | 0 | North | 8.754514 | -1.387904 | 0.172505 | 2 | 0.388965 | 0.098190 | 1123619.02 |
| 7 | CUST00007 | 56 | 72719.36 | Employed | 12462.95 | 1 | 741.0 | 5186.24 | 1334.91 | 4 | East | 8.553957 | 1.169375 | 1.398565 | 0 | 0.107102 | 0.071318 | 989168.31 |
| 8 | CUST00008 | 36 | 43558.48 | Employed | 6749.36 | 1 | 676.0 | 4432.63 | 1489.23 | 0 | Central | 8.396974 | -0.531246 | 0.174823 | 2 | 0.220615 | 0.101760 | 1006719.48 |
| 9 | CUST00009 | 40 | 35614.31 | Employed | 5007.45 | 5 | 704.0 | 10136.99 | 1789.37 | 1 | Central | 9.224045 | 1.169690 | -0.415433 | 1 | 0.357270 | 0.284625 | 1259716.48 |
| customer_id | age | income | employment_status | credit_card_limit | num_credit_cards | credit_score | loan_balance | avg_monthly_spend | missed_payments | region | loan_balance_log | pca1 | pca2 | cluster | spend_to_limit_ratio | debt_to_income_ratio | spend_credit_score_interaction | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | CUST09990 | 44 | 61787.21 | Unemployed | 7620.69 | 1 | 685.0 | 2269.03 | 1541.52 | 0 | South | 7.727548 | -0.806224 | 0.159596 | 2 | 0.202254 | 0.036723 | 1055941.20 |
| 9991 | CUST09991 | 35 | 79691.40 | Employed | 4682.80 | 3 | 658.0 | 1026.66 | 1796.56 | 0 | South | 6.935040 | -1.031192 | -0.962547 | 2 | 0.383569 | 0.012883 | 1182136.48 |
| 9992 | CUST09992 | 41 | 72282.12 | Employed | 10546.25 | 2 | 708.0 | 4242.44 | 1703.00 | 3 | South | 8.353130 | 0.717278 | 0.163800 | 0 | 0.161464 | 0.058692 | 1205724.00 |
| 9993 | CUST09993 | 54 | 52803.19 | Freelancer | 11137.48 | 0 | 693.0 | 19096.12 | 2216.89 | 2 | West | 9.857293 | 0.347880 | 0.741722 | 0 | 0.199030 | 0.361640 | 1536304.77 |
| 9994 | CUST09994 | 41 | 77917.02 | Employed | 7025.31 | 1 | 673.0 | 6970.02 | 1629.36 | 3 | North | 8.849517 | 0.034828 | 0.509900 | 2 | 0.231894 | 0.089453 | 1096559.28 |
| 9995 | CUST09995 | 55 | 61829.60 | Employed | 8399.68 | 0 | 773.0 | 50297.34 | 1087.02 | 2 | East | 10.825727 | -0.545960 | 3.426974 | 3 | 0.129397 | 0.813470 | 840266.46 |
| 9996 | CUST09996 | 51 | 57877.36 | Employed | 14626.40 | 2 | 660.0 | 5879.15 | 1995.95 | 3 | South | 8.679338 | 1.019767 | -0.570666 | 0 | 0.136453 | 0.101578 | 1317327.00 |
| 9997 | CUST09997 | 57 | 50476.02 | Unemployed | 13478.49 | 1 | 728.0 | 2441.33 | 1723.67 | 5 | South | 7.800708 | 2.411201 | 1.108859 | 0 | 0.127874 | 0.048365 | 1254831.76 |
| 9998 | CUST09998 | 64 | 63320.38 | Employed | 5546.73 | 3 | 687.0 | 2914.59 | 1381.68 | 1 | North | 7.977827 | 0.125488 | 0.427576 | 2 | 0.249053 | 0.046029 | 949214.16 |
| 9999 | CUST09999 | 32 | 79929.25 | Freelancer | 9746.09 | 4 | 687.0 | 10054.76 | 2187.70 | 3 | Central | 9.215901 | 0.784293 | -1.017262 | 0 | 0.224446 | 0.125794 | 1502949.90 |